A Semi-Parametric Multiple Imputation Data Augmentation Procedure
نویسنده
چکیده
Multiple imputation procedures (MI) are a useful tool to adjust for item non-response but are often based on fully parametric assumptions, such as multivariate normality. For many applications such assumptions may not hold in practice, for example if the data are skewed and affected by rounding and truncation effects. Hot deck imputation methods, however, make less or no assumptions about underlying distributions and may be more appropriate to use in such circumstances. The basic idea is to combine multiple imputation and hot deck approaches with the aim of preserving advantageous properties of MI and at the same time relaxing distributional assumptions. This paper develops a semi-parametric data augmentation method to generate MI under the missing at random assumption and a nonignorable missing data mechanism. The use of predictive mean matching as a form of hot deck imputation is considered as part of the data augmentation procedure to improve the robustness of the multiple imputation method.
منابع مشابه
A multiple imputation approach to linear regression with clustered censored data.
We extend Wei and Tanner's (1991) multiple imputation approach in semi-parametric linear regression for univariate censored data to clustered censored data. The main idea is to iterate the following two steps: 1) using the data augmentation to impute for censored failure times; 2) fitting a linear model with imputed complete data, which takes into consideration of clustering among failure times...
متن کاملEstimating Semi-Parametric Missing Values with Iterative Imputation
In this paper, the author designs an efficient method for imputing iteratively missing target values with semi-parametric kernel regression imputation, known as the semi-parametric iterative imputation algorithm (SIIA). While there is little prior knowledge on the datasets, the proposed iterative imputation method, which impute each missing value several times until the algorithms converges in ...
متن کاملImputation of Missing Data for the Pre-Elementary Education Longitudinal Study
In the Pre-Elementary Education Longitudinal Study (PEELS), imputation of item missing data was done using AutoImpute (AI) software, which uses semi-parametric modeling to form imputation classes. In this paper, we summarize PEELS experience with AI, investigate the bias aspect of the imputed data for the PEELS teacher questionnaire data, and study the variance estimation of imputed data using ...
متن کاملMissing Values with iterative imputation
In this paper, the author designs an efficient method for imputing iteratively missing target values with semiparametric kernel regression imputation, known as the semi-parametric iterative imputation algorithm (SIIA). While there is little prior knowledge on the datasets, the proposed iterative imputation method, which impute each missing value several times until the algorithms converges in e...
متن کاملAccuracy evaluation of different statistical and geostatistical censored data imputation approaches (Case study: Sari Gunay gold deposit)
Most of the geochemical datasets include missing data with different portions and this may cause a significant problem in geostatistical modeling or multivariate analysis of the data. Therefore, it is common to impute the missing data in most of geochemical studies. In this study, three approaches called half detection (HD), multiple imputation (MI), and the cosimulation based on Markov model 2...
متن کامل